The Waterlogged project was envisioned as a means of investigation into how human activities affect water quality (specifically, organic matter concentration and composition). The hope of using citizen science would be to serve both educational and community engagement goals, as well as to increase the reach of sampling.
The Waterlogged project was designed as a contributing type of citizen science project, in which project aims and directions were designed by scientists for public participation. In the project, participants were told that the aim of the project was to ‘investigate how human activities affect water quality, specifically around the concentration and types of organic matter within surface waters like streams and rivers’. Given that organic matter is a less known water quality parameter, information regarding organic matter (what it is, and why it is important to investigate in terms of environmental and human health), was made available through conversation as well as on the project website.
Participants were given sample kits, which consist of acetic acid bathed brown PE bottle in a ziplock bag. This ziplock bag contains a small pencil; attached to the bag is the protocol for taking a sample, as well as the data sheet to be filled out (both written on waterproof paper.) A freezer pack was also included to keep the sample cool to prevent degradation.
The protocol for taking a sample was designed to be scientifically robust, as well as easy for people to follow. This included criteria for choosing a sample location, which included stipulation that the sample be from streams and rivers, and that it be actively flowing. The importance of safety in choosing a site and taking a sample was also emphasized; participants were told not to enter the water, and to watch for hazards such as slippery, wet surfaces. Prior to taking a sample, participants were told to rinse the bottle at least three times, to sample in a location that had not been disturbed, and to ensure that contamination of the bottle and sample did not occur. Participants were also asked to note critical information about the sample on the sample collection sheet. This included the date and time of collection, the GPS location if possible, and a description of the location and anything within the vicinity that could affect water quality. A sample description was also included to help orient participants. Participants were also asked to take a picture up, down, and across from where the sample was taken to help orient and provide data regarding sample surroundings.
Samples were analyzed for various water quality parameters once the sample was returned to the scientific team. These parameters included total suspended solids (TSS), nitrate concentration (NO3), dissolved organic carbon concentration (DOC), and a range of indices calculated from the absorbency and fluorescence spectra.
Skepticism regarding the quality of data collected by citizens remains a key concern for traditional scientific actors in accepting citizen science projects as adequately rigorous. Given these concerns, we used a paired sampling approach to compare data from samples collected by citizens to that of a pool of samples that scientifically trained individuals collected and analyzed. Because citizen sampling occurred at a range of different sites, we could not fully reproduce the spatial and temporal variability at which citizen sampling occurred. Given this, we attempted with our own sampling to sample sites within the local context that would be popular with sampling, as well as sample a range of sites to provide a snapshot of water quality characteristic of the region. The citizen-data pool comprised some 150 samples, and the our own data comprised some 110 samples.
Several main themes are explored using the Waterlogged case study.
What is an outlier?
How do citizens, and citizen-lead community groups, insert their own interests into data collection itself?
Contradictory questions that matter to scientists versus citizens (investigating long term trends versus effects of a specific event).
Place centrism of data collection for citizens – you can’t possibly meet citizen standards, given their placement within the context of sampling
Data might be primary for scientists, but secondary for citizens whose aim is education.
Aberrations that occur but are within normal distribution but are interpreted as more meaningful by citizens due to context of collection.
Lesson learned: 5) Data might be primary for scientists, but secondary for citizens whose aim is education
Sample Description:
Evergreen is a national environmental not-for-profit whose participation within the Waterlogged project coincides with their Uncover Your Creeks citizen science project. This program focuses on engagement and education, directly involving citizens in the restoration, rehabilitation and monitoring of streams within the urban Metro Vancouver context.
Evergreen took dim(ES)[1] samples during the Waterlogged tenure within sites that they use for their Uncover Your Creeks program, including the the Serpentine River, Wagg Creek, Chubb Creek, Still Creek and Jericho Pond (shown as blue dots).
Samples within the Evergreen subset tended to have higher DOC and NO3 than the average in both the citizen and scientist subsets. This could be attributed to the sample sites in which Evergreen has focused on, noting that they tend to be highly impacted by urban activities (such as the Still Creek and Chubb Creek sites), and thus may have inflows of storm water laden with NO3 and DOC from urban sources. Additionally, the greatest DOC concentrations correspond top samples from the Jericho Pond site, a man-made pond that suffers heavily from eutrophication, especially within the summer months in which sampling occurred.
Lessons Learned:
Lesson Learned: 2. How do citizens, and citizen-lead community groups, insert their own interests into data collection itself? 6) Aberrations that occur but are within normal distribution but are interpreted as more meaningful by citizens due to context of collection
Sample Details:
The Silver Creek Streamkeepers are a small monitoring group tasked with monitoring Silver Creek, a small urban stream whose headwaters is located within Burnaby Mountain/SFU;it flows south, through residential areas in its norther reaches and increasingly industrial lands until it flows into the Brunette River, and ultimately Burnaby Lake. The creek is home to various species of fish (including endangered Nooksack Dace, coho and chum salmon and cutthroat trout), and other fauna such as turtles, salamanders, beavers, coyotes and others. The streamkeeper groups is involved in monitoring, as well as restoration and salmon release activities.
Participation within the Waterlogged project was prompted partially out of a desire for increase data regarding water quality in general, as well as concern over a coal train derailment. This derailment occurred in the lower reaches of Silver Creek, just upstream of where the stream enters Burnaby Lake, where Silver Creek emerges from a culvert beneath a Costco parking lot. Three Canadian National (CN) cars tipped into the stream, spilling 270 tonnes of metallurgical coal (a smaller percentage of which entered the stream, according to the Ministry of Environment) on January 11, 2014. It was estimated that this resulted in 5.5 cubic meters of coal in Silver Creek, and 76 cubic meters of coal in downstream Burnaby Lake (noting that heavy rains at the time of the derailment transported a large amount of coal downstream from the derailment). The cause of the derailment was attributed to heavy rains that caused a beaver dam breach upstream of the tracks, causing the stream to overflow tracks close to Brighton Avenue and Government Street in Burnaby, causing seven cars to derail and three cars containing coal to ultimately tip into the stream.
Initially, the ecological impact of the coal spill was estimated to be low, with arising within animals breathing coal sediment through lungs or gills. It was expected that the spill would have with low water quality impacts. However, initial reports regarding the impact of the spill were deemed “deficient in addressing the conditions, environment, biota, and potential concerns in the impacted water bodies,” according to a Ministry of Environment memo dated February 6, 2014 (released on April 16, 2014). At this time, the Ministry requested that CN submit detailed information about the amount of coal spilled and recovered following the accident, and the environmental consequences and fate of the coal remaining in the creek and downstream lake. CN was also required to submit and undertake cleanup and protective measures on the site in an attempt to remediate and protect against future spills; such protective measures included streamside buffers and setbacks, reinforcing stream banks, removing hazardous trees, culvert cleaning, and goals towards protection of downstream water and sediment quality. Remediation and removal of coal within the stream and lake was finished by April 15, 2014, and was primary accomplished by suction dredging , where the presumed coal-water mixture was suctioned through filters to remove coal and sediments, returning filtered water to the stream. After remediation, the company stated that they met provincial cleanup criteria, and that a ‘minuscule’ amount of coal was left, but could not specify exact figures regarding residual amounts.
The Silver Creek Streamkeepers took five samples in a transect from stream headwaters to its intersection with the Brunette River in fall, 2014. All of these samples demonstrated higher than average NO3 concentrations (Figure). In addition, the samples upstream of the coal derailment (in blue), had a slightly lower concentration of DOC as compared to samples taken downstream of the derailment (in black). Additionally, we noted the presence of a unique peak within the fluorescence spectra of samples originating downstream of the derailment (in Supplemental Information).
Lessons Learned:
The dominant reason behind participation within the Waterlogged project was to help gain an understanding of how the coal trail derailment had affected the ecosystem (noting that the Municipality of Burnaby suggested their participation knowing of these concerns). Other concerns regarding watershed issues that could affect water quality were also expressed. This included the dominate industrial uses of the watershed itself, significant recent construction just upstream of the derailment and sampling site, deforestation associated with a pipeline assessment, exposed peat within the channel (noting that peat farming was a past land use), and creosote-soaked wooden ties within the stream at the site of the derailment. These interests shaped where samples were taken (noting the transect in which samples were taken), and may shape interpretations of the data itself. Although sites downstream of the derailment site show unique features discussed above, it is nearly impossible to say whether the slight changes in water quality are firstly significant, and secondly derive from the coal train derailment itself. Understanding the impact of derailment within this complex milieu would require a specific research program that would comprise a significantly greater degree of sampling. Additionally, while the methods we used to analyze organic matter composition have been used in a limited way to illuminate possible oil contamination, other analytic approaches (such as determination of hydrocarbon and metal concentrations) would likely be more direct.Thus, because of limitations arising from both sampling as well as a methodology that was not quite tailored to the specific concerns, speculations as to the water quality effects of both the specific spill event, as well as that deriving from watershed conditions, is merely speculation given the data produced within the context of the Waterlogged investigation. Given the expectations and interests brought to sampling, such a conclusion was an unsatisfying conclusion to a compelling story regarding water quality and impacts within our local context.
References (regarding the derailment): http://www.vancouversun.com/launches+cleanup+coal+dumped+January+derailment+Burnaby/9584019/story.html http://www.vancouversun.com/Coal+spill+feared+risk+aquatic+life+Burnaby+Silver+Creek/9406236/story.html http://www.straight.com/news/629231/cn-report-burnaby-coal-derailment-deficient-lacks-details-environmental-impacts http://www.burnabynow.com/news/cn-finishes-coal-spill-cleanup-1.963761#sthash.1w2efbBC.dpuf
Lesson Learned: 2) How citizens insert their own interests into data collection itself; 3. Contradictory questions that matter to scientists versus citizens (investigating long term trends versus effects of a specific event).
Sample Description:
Staff from the Municipality of Bowen Island were interested in participating within Waterlogged given concerns over the effect that possibly leaking septic fields were having on a stream-fed lagoon. This area has seen much recent development of residential homes around the lagoon itself without the development of an expensive wastewater system; thus, new and existing developments use septic systems. Municipal staff were concerned about the aggregate effects of possible septic contamination into the lagoon, and were interested in any preliminary water quality data that could help determine whether more intensive study of the site is necessary.
Municipal staff took four total samples in fall of 2015. One sample was taken in the stream entering the lagoon, prior to residential development, and was thus considered by the samplers to be a control. This ‘control’ showed lower concentrations of both NO3 and DOC when compared to the lagoon samples. In particular, a sample taken at the north side of the lagoon (considered to be a likely candidate by the participants for septic contamination) did demonstrate the highest NO3 concentrations of an sample within the sample set. This sample also had high levels of TSS (dark filter shown in Supplemental Information).
Lessons Learned:
Although sewage contamination can increase NO3 concentrations, other factors, such as fertilizers from lawns and gardens, can also increase concentrations. A more direct way of testing for septic field infiltration would be bacterial testing to test for the presence of fecal coliforms. Thus, participation within the project lent some gravity to concerns about septic field contamination, with the provision that the specific question posed by the municipality would be best answered using other analytic methods and a more in-depth sampling regime.
Thus, while the Waterlogged project can suggest that this sample might be affected by septic field contamination, the definitive presence of fecal coliforms (which would pinpoint such contamination), is beyond the ability of this type of analysis to determine. This speaks to the difficulty of reconciling the motivations that citizens bring to sampling, as well as the fact that questions that matter to citizens may differ from those the scientist sets out to examine.
Lesson Learned: 2) How citizens insert their own interests into data collection itself; 6) Aberrations that occur but are within normal distribution but are interpreted as more meaningful by citizens due to context of collection
Sample Description:
The North Shore Wetland Partners is a volunteer run community organization founded in 2004 and incorporated in 2010 whose mandate is “to promote conservation of North Shore wetlands through education, research, restoration and stewardship.” Their interest in participating within Waterlogged was to augment water quality and ecosystem monitoring and restoration activities they undertake throughout Vancouver’s North Shore, focused somewhat on natural and man-made wetlands, but also ecological health of aquatic systems generally (including the Seymour River).
The NSWP took some 12 samples from ** of sample sites across that North Shore that, to the best of our knowledge, comprise their own research and sample sites. From the Waterlogged data, 9 out of 12 samples exhibited higher than average DOC concentrations, and eight out of 12 sites showed higher than average NO3 concentrations.
Lessons Learned:
In addition to concerns about urbanization within watersheds sampled, the NWSP expressed concerns to us that one site in particular had been affected by clear cutting (which occurred in the late 2000s) which they considered to be illegal. They expressed concerns through returned data sheets that samples taken within the vicinity of this activity suffered from unusually high concentrations of iron (indeed, in certain instances the sample data sheet was stained with the characteristic orange colour of iron oxides). Given that iron analysis of beyond the scope of the work, we could not assess in a quantitative way whether iron was particularly high in these (or any) samples. Additionally, we note that iron can be characteristic of the iron-rich podzol that dominates much of the local soils, where it would be difficult to link any loss of forest cover to increased iron. However, we also note that high concentrations of iron gives streams and rocks a vivid orange hue; because the NSWP knows the historical precedence of the watershed, this could serve as a dramatic visual cue that water quality is impaired within a context in which they are aware that disruption has taken place.
Lesson Learned: 1) What is an outlier?; 4) Place centrism of data collection for citizensm
Sample Description:
The North Shore Streamkeeprs, a streamkeeping group central to the operation of the Pacific Streamkeepers Organization, monitors the health of several watersheds around North Vancouver. They are a unique organization in that they have developed (in collaboration with the Department of Fisheries and Oceans) a in-depth scientific protocol for monitoring that has been adopted by numerous local streamkeeping group. They extensive volunteer activities are coordinated by an exceptional community leader within the organization. This group expressed concerns about urbanization effects in general (especially on fish-bearing streams), as well as previous study they conduction which showed that sewage contamination can occur within their sampling sites due to improperly or illegally connected sewer systems from houses.
Samples from this organization tended to have lower than average DOC concentrations, but be slightly elevated in NO3 when compared to the rest of the data.
Lessons Learned:
What is striking about samples from this organization is the presence of a high DOC concentration sample, noted in red in the figure below. This sample was taken from Hastings Creek, downstream of a construction site. The person sampling is highly involved in watershed activities within the North Shore, and noted a large sediment plume originating from a culvert leading from the construction site. This on the ground timeliness allowed the individual to take an immediate sample. Upon analysis, we noted that the sample was opaque with sediment, much of which persisted despite filtering with a 70 micron filter. Because of the nature of our optical-based measurement of DOC, this fine suspended sediment contributed to a DOC concentration measurement of 13.67 mg/L - the fourth highest DOC measurement across all data (an inflated number based on the interference of the sediment present). Thus, the capacity of citizen samplers, especially the community champions who are actively involved, can provide samples and data regarding events that would occur at a timescale that would be difficult for others to capture, given the need to realize the issue, gather equipment, and get to the site. This sample also questions the concept of an outlier within the context of the data sampled- as noted, this sample is one of the highest DOC concentrations noted due to interference. However, contextual data about the circumstances of sampling contextualizes what could otherwise be considered a poor or overinflated data point.
Pacific Streamkeepers Links: -Taccogna, G. and K. Munro (eds). 1995. The Streamkeepers Handbook: a Practical Guide to Stream and Wetland Care. Salmonid Enhancement Program, Dept. Fisheries and Oceans, Vancouver, BC. - http://www.pskf.ca/publications/Handbook%20and%20Modules.pdf
Lesson Learned: 1) What is an outlier? 2) How citizens insert their own interests into data collection itself; 3) Contradictory questions that matter to scientists versus citizens (investigating long term trends versus effects of a specific event).
Sample Description:
The sample taken the furthest outside of the Metro Vancouver region corresponded to a sample taken on the Quesnel River, taken three days after a major dam breech spilled mine effluent into the Quesnel Lake and River. A drinking water ban was immediately placed on these water sources, though subsequent water quality testing would show that water was acceptable in terms of provincial drinking water guidelines (noting that elevated concentrations of metals including selenium, arsenic, copper, and vanadium were found within sediment, with unknown long term environmental effects within affected environments).
From the description taken by the person sampling: “Dock near boats. summertime. Clear morning around 24degC. Three days after major environmental disaster at Polley Mountain Mine. Lake and surrounding water ways have floating debris + government water ban.”
Data from this single sample was not remarkable in either DOC or NO3 concentrations - indeed, the sample showed lower than average DOC and NO3 concentrations.
Lessons Learned:
This sample again speaks to the motivations that underlie intentions to sample. In this case, the occurrence of a major, well publicized event spurred the participant to take a sample, despite the incompatibility of our research goal and correspondent method at investigating the likely effects on water quality from this major event.
This sample also showcases that water quality is a matter of lenses, speaking also to the question of what is an outlier. This sample is clearly not an outlier by measure of either sample set; yet, this watershed had clearly experienced an event that mine safety experts have called one of the largest in Canadian history (noting that the sample was taken only three days after the mine dam breached). In this case, this exceptional sample shows simply as an ‘in-lier’ within the data distribution, belying the circumstances under which it was collected.
Lesson Learned: Community sampling as a response to a specific event; Limits of methodology; Difficulty of determining effects from specific events or conditions within generalized science plan; Sophistication of community resources and existing activities
Sample Description:
Lawson Creek is a small urban creek that runs from Cypress Mountain south to the Burrard Inlet, through highly residential areas and a man-made wetland. Both the Pacific Wetland Partners and the West Vancouver Streamkeepers have been involved within long term monitoring, restoration, and fish release within the creek.
In addition to general concerns regarding consequences of urban development on ecological health within this watershed, both groups have undertaken an environmental assessment to explain why a large percentage of released fish died. This fish kill was attributed to stress during transport; the streamkeeping groups expressed doubt as to whether transport stress killed the fish, as they died several weeks after introduction, but could not otherwise attribute the deaths based on the water quality monitoring they had done. In their words,“I think we just don’t know why the fish were killed.”"
In terms of water quality parameters observed through Waterlogged participation, five total samples were taken lower in the reach of the stream, where most of the monitoring activities have to date taken place. None of the samples were remarkable in terms of their DOC or NO3 concentrations, noting that neither would have toxic effects to the fish at high concentrations.
Lessons Learned:
Both the sampling for our project, as well as subsequent sampling occurring after the fish mortality event, speaks to the nature of all grab and point sampling as a snapshot in time. Even with the increased capacity of citizens to reach and sample a location within their local context, it is possible that if the fish deaths were related to a water quality issue, the timescale at which this even may have occurred may have been too short to sample. Thus, it is impossible to completely rule out the occurrence of a short-time period water quality event that contributed to fish mortality, despite the effort and resources dedicated to testing that showed unremarkable results for a range of water quality metrics. Because the citizens monitoring the stream have the ability to mobilize sampling at a much shorter time period than professional scientists, what does this say about our ability to investigate destructive events that occur at short time periods, especially ones occurring within locations more remote than an easily accessed urban stream?
To look at the difference between citizen data and our own, we use histograms and cumulative distribution functions to look at density of data within both data sets around mean values.
Expressed as histograms, with our data in one and the citizen data in another (for DOC concentrations only):
Lastly, can also look at the cumulative distribution function to see shifts in the data between the two sets. See the evidence of a ‘tail’ in the WL data, showing contribution of higher DOC samples.
The first statistical test for significant differences is a t-test to look at whether means are significantly different between the two groups.
##
## Welch Two Sample t-test
##
## data: DOC_mgL by code
## t = -0.61409, df = 245.53, p-value = 0.5397
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -1.0350437 0.5430393
## sample estimates:
## mean in group DBP mean in group WL
## 2.937426 3.183428
##
## Welch Two Sample t-test
##
## data: NO3mgL by code
## t = -3.3725, df = 187.59, p-value = 0.0009047
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -0.4380607 -0.1147217
## sample estimates:
## mean in group DBP mean in group WL
## 0.1619762 0.4383674
##
## Welch Two Sample t-test
##
## data: e2e3 by code
## t = 5.6275, df = 147.6, p-value = 8.92e-08
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## 0.5152458 1.0729640
## sample estimates:
## mean in group DBP mean in group WL
## 5.323957 4.529852
##
## Welch Two Sample t-test
##
## data: e4e6 by code
## t = 2.2597, df = 103.3, p-value = 0.02594
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## 0.6954047 10.6690722
## sample estimates:
## mean in group DBP mean in group WL
## 10.42242 4.74018
##
## Welch Two Sample t-test
##
## data: FI by code
## t = -7.9388, df = 203.87, p-value = 1.345e-13
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -0.15263530 -0.09190216
## sample estimates:
## mean in group DBP mean in group WL
## 1.418502 1.540771
##
## Welch Two Sample t-test
##
## data: HIX_ohno by code
## t = 5.182, df = 221.9, p-value = 4.926e-07
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## 0.04812629 0.10719400
## sample estimates:
## mean in group DBP mean in group WL
## 0.8971810 0.8195209
##
## Welch Two Sample t-test
##
## data: BIX by code
## t = -4.4366, df = 203.15, p-value = 1.497e-05
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -0.15189578 -0.05842564
## sample estimates:
## mean in group DBP mean in group WL
## 0.5418329 0.6469936
##
## Welch Two Sample t-test
##
## data: peakA by code
## t = -5.5607, df = 168.93, p-value = 1.032e-07
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -1.9290545 -0.9182321
## sample estimates:
## mean in group DBP mean in group WL
## 0.6847251 2.1083685
##
## Welch Two Sample t-test
##
## data: peakC by code
## t = -5.0503, df = 173.19, p-value = 1.111e-06
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -0.9339636 -0.4090765
## sample estimates:
## mean in group DBP mean in group WL
## 0.3217908 0.9933108
##
## Welch Two Sample t-test
##
## data: peakT by code
## t = -3.9648, df = 174.19, p-value = 0.0001071
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -0.21844186 -0.07324288
## sample estimates:
## mean in group DBP mean in group WL
## 0.09851948 0.24436186
##
## Welch Two Sample t-test
##
## data: peakB by code
## t = -5.6172, df = 159.55, p-value = 8.436e-08
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -0.2346397 -0.1125667
## sample estimates:
## mean in group DBP mean in group WL
## 0.07616346 0.24976667
##
## Welch Two Sample t-test
##
## data: redox by code
## t = 1.5228, df = 227.55, p-value = 0.1292
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -0.004681242 0.036526444
## sample estimates:
## mean in group DBP mean in group WL
## 0.4212347 0.4053121
##
## Welch Two Sample t-test
##
## data: perprotein by code
## t = -1.9307, df = 202.33, p-value = 0.05492
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -3.55069829 0.03737825
## sample estimates:
## mean in group DBP mean in group WL
## 4.693979 6.450639
Shown are the filters after sampling the Bowen Island samples. On the far left is the ‘control’ sample; the sample with high NO3 corresponds to the filter with the greatest amount of sediment (middle right filter).